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Amendments to the Specification 



In the BRIEF DESCRIPTION OF THE DRAWINGS on page 3, please replace the 
description of Figure 2 as follows. Deleted text is indicated by strikeout font and added text 
is indicated by underlining. 



Figure 2 shows one e mbodimen t Figures 2A and 2B show embodiments of 
a method of selecting a speech recognizer using performance prediction, in 
accordance with the invention.— 



Please replace the last paragraph on page 3, beginning with "The recognizers, 14a- 
14n" as follows. Deleted text is indicated by strikeout font and added text is indicated by 
underlining. 

-- The recognizers, 1 4a 1 4n 14a2l4g, receive the input stream, it must be 
noted that although it appears that there are 14 recognizers A-N a-M . the use of the 
letter N n is in the mathematical sense, where N n is the number of recognizers 
decided upon by the system designer. In some embodiments* the predictor 22 may 
control the routing of the input stream to the recognizers. The recognizers receiving 
the input stream and converting it will be referred to as the enabled recognizers. The 
enabling of the recognizers by the predictor will be discussed in more detail below.-- 




Please replace the first full paragraph on page 4> beginning with "The enabled 
recognizers perform," as follows. Deleted text is indicated by strikeout font and added text is 
indicated by underlining, 

-The enabled recognizers perform the speech recognition tasks. The output 
of the enabled recognizers would then be sent to an output switch 16. The predictor 
then selects a set of results 20 from the results presented to the output switch 16. The 
basis of that selection is discussed in more detail below.— 
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Please replace the second full paragraph on page 4, beginning with **One embodiment 
of performing," as follows. Deleted text is indicated by strikeout font and added text is 

indicated by underlining . — 

—One embodiment of performing the selection of the appropriate recognizer is 
shown in Figure 2 2A. As discussed above with regard to Figure 1, an input stream is 
received at 30. The input stream carries the speech input, as well other sounds and 
audio cues that may be used in deriving information pertinent to the speech 
recognition task.-- 

^ ■ ■ 

Please replace the third full paragraph on page 4, beginning with "For example, at 
32," as follows. Deleted text is indicated by strikeout font and added text is indicated by 

underlining. ^ 

■•n — For example, at 32, the input stream may be analyzed to determine 

characteristics of the communication channel. In some systems, analyzers exist that 
allow the system to determine the audio characteristics of the channel, for example, 
determining if the cellular or landline communication networks are in use. Other 
information may also be derived, including background noise and signal strength 
among others. Additionally, this analysis may determine characteristics of the 
communication device, for example determining if a speaker-phone or a wireless 
handset are in use/This analysis may occur as part of deriving enabling information 
for the recognizers. Network-based information services such as CallerlD in 
conjunction with a local or network-based database mapping calling number to 
channel and device characteristics may be utilized for similar effect- 



Please replace the first full paragraph on page 9, beginning with "In one embodiment 
of the invention " as follows. Deleted text is indicated by strikeout font and added text is 
indicated by underlining. 

-In one embodiment of the invention, the flow shown in Figur e 2 as shown in 
Figure 2A is modified to chang e s to include 40 and 42 , as shown in Figure 2B . The 
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input stream is routed to the enabled recognizers in 40. The enabled recognizer 
provides results and associated recognizer information that may include individual 
result confidence values, for example, to the predictor at 42. The predictor then uses 
the recognizer information as part of the selection information in 34.-- 



Please replace the first full paragraph on page 11, beginning with "Generally, the 
selection information/' as follows. Deleted text is indicated by strikeout font and added text 
is indicated by underlining. 

-Generally, the selection information with regard to Recognizer A will be 
updated, and the updated selection information will cause a different recognizer to be 
selected than was used in previous interactions. An interaction could be an utterance 
within a session. For example, the user makes several utterances separated by silence. 
For the first few utterances, the system may use Recognizer B 5 as Recognizer A 
adapts. For the remaining utterances, the system may switch to Recognizer A, not the 
recognizer used for the previous utterances. Alternatively, an interaction could be an 
entire session where the user enters a speech recognition process and then ends it, 
such as dialing into a telephone-based system, and ending by hanging up. -. — In this 
example, Recognizer B may be used for a first few sessions with a particular user, 
while Recognizer A is adapting, and then the system may select Recognizer A after it 
has adapted. — 

Please replace the first full paragraph on page 12, beginning with "Referring to Figure 
2," as follows. Deleted text is indicated by strikeout font and added text is indicated by 
underlining, 




-- Referring to Figuro2 both Figures 2A and 2B> after the results are returned 
to (he application in block 38 . feedback may be generated at 44, as an option. This 
feedback may then be used to update information about the recognizers at 46, such as 
their performance with a given characteristics in the contextual information, a 
weighting that is applied to their individual-result confidence values, their 
optimization for various channels and devices, among many others. This is optional. 
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This updated information may also be used when deriving the enabling information 
32_or the selection information 34* or both. The feedback may be generated in several 
ways, and at several different times. For example, feedback generated off-line will 
not occur at the place in the process shown here, but much later. In some systems, the 
user may provide the feedback, directly indicating the recognition accuracy. This may 
occur, for example, in dictation applications. The predictor may track the user's 
corrections and use that information to tune the prediction process. In many 
applications, this is too cumbersome to be practical. A variety of indirect 
measurement methods are discussed below.— 
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